Optimal Clustering and Non-uniform Allocation of Gaussian Kernels in Scalar Dimension for Hmm Compression
نویسندگان
چکیده
We propose an algorithm for optimal clustering and nonuniform allocation of Gaussian Kernels in scalar (feature) dimension to compress complex, Gaussian mixture-based, continuous density HMMs into computationally efficient, small footprint models. The symmetric Kullback-Leibler divergence (KLD) is used as the universal distortion measure and it is minimized in both kernel clustering and allocation procedures. The algorithm was tested on the Resource Management (RM) database. The original context-dependent HMMs can be compressed to any resolution, measured by the total number of clustered scalar kernel components. Good trade-offs between the recognition performance and model complexities have been obtained; HMM can be compressed to 15-20% of the original model size, which needs 1-5% of multiplication/division operations, and results in almost negligible recognition performance degradation.
منابع مشابه
Non-uniform Kernel Allocation Based Parsimonious HMM
In conventional Gaussian mixture based Hidden Markov Model (HMM), all states are usually modeled with a uniform, fixed number of Gaussian kernels. In this paper, we propose to allocate kernels nonuniformly to construct a more parsimonious HMM. Different number of Gaussian kernels are allocated to states in a non-uniform and parsimonious way so as to optimize the Minimum Description Length (MDL)...
متن کاملTraining Discriminative HMM by Optimal Allocation of Gaussian Kernels
We propose to train Hidden Markov Model (HMM) by allocating Gaussian kernels non-uniformly across states so as to optimize a selected discriminative training criterion. The optimal kernel allocation problem is first formulated based upon a non-discriminative, Maximum Likelihood (ML) criterion and then generalized to incorporate discriminative ones. An effective kernel exchange algorithm is deri...
متن کاملOn the Optimal Allocation of Bandwidth for Source Coding, Channel Coding and Spreading in a Coherent DS-CDMA System Employing an MMSE Receiver
ABSTRACT This paper analyzes a CDMA system with a continuous uniform source, a scalar quantizer, and a hard decision block channel coder, under the constraint of fixed total bandwidth. The system employs a linear, minimum mean-square-error (MMSE) receiver for interference suppression. We study the performance for both additive white Gaussian noise (AWGN) and flat Rayleigh fading channels. The a...
متن کاملSpeech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کاملSuboptimality of the Karhunen-Loeve transform for fixed-rate transform coding
AbstractAn open problem in source coding theory has been whether the Karhunen-Loève transform (KLT) is optimal for a system that orthogonally transforms a vector source, scalar quantizes the components of the transformed vector using optimal bit allocation, and then inverse transforms the vector. Huang and Schultheiss proved in 1963 that for a Gaussian source the KLT is mean squared optimal in ...
متن کامل